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Abstract 

When an advantageous mutation occurs in a population, the favorable allele may spread 
to the entire population in a short time, an event known as a selective sweep. As a result, 
when we sample n individuals from a population and trace their ancestral lines backwards in 
time, many lineages may coalesce almost instantaneously at the time of a selective sweep. We 
show that as the population size goes to infinity, this process converges to a coalescent process 
called a coalescent with multiple collisions. A better approximation for finite populations can 
be obtained using a coalescent with simultaneous multiple collisions. We also show how these 
coalescent approximations can be used to get insight into how beneficial mutations affect the 
behavior of statistics that have been used to detect departures from the usual Kingman's 
coalescent. 

1 Introduction 

Our goal in this paper is to describe the coalescent processes that arise when we consider the 
genealogy of a population that is affected by repeated beneficial mutations. The starting point 
for this analysis will be the continuous-time population model introduced by Moran (1958). In 
this model, the population size is fixed at 2N. Each individual independently lives for a time 
that is exponentially distributed with mean 1 and then is replaced by a new individual. The 
parent of the new individual is chosen at random from the 2N individuals, including the one 
being replaced. Note that we can think of the population as consisting of 2N chromosomes of 
diploid individuals, so each member of the population has just one parent. 

Suppose we sample n individuals at random from this population at time zero. To describe 
the genealogy of the sample, we will define the ancestral process, which will be a continuous- 
time Markov process (^'Ar(t),t > 0) whose state space is the set Vn of partitions of {1, . . . ,n}. 
The ancestral process describes the coalescence of lineages as we follow the ancestral lines of the 
sampled individuals backwards in time. More precisely, ^'7v(0) is the partition of {1, . . . ,n} into 
n singletons, and ^Nit) is the partition of {1, . . . ,n} such that i and j are in the same block of 
^'Ar(t) if and only if the ith and jth individuals in the sample have the same ancestor at time 
—Nt. It is well-known that the process {^N{t),t > 0) is Kingman's coalescent, a coalescent 
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process introduced by Kingman (1982). Kingman's coalescent is a "P^-valued Markov process 
that starts from the partition of {1, . . . ,n} into singletons. All transitions involve exactly two 
blocks of the partition merging together, and each such transition occurs at rate one. 

Within the last decade, progress has been made on describing the genealogy of populations in 
models that allow for natural selection. Krone and Neuhauser (1997) and Neuhauser and Krone 
(1997) studied a model in which each individual can be of type 1 or 2. An individual of type i 
produces offspring at rate Aj, with A2 > Ai so that type 2 is advantageous. Each new offspring 
replaces a randomly chosen individual from the population, and is the same type as its parent 
with probability 1 — un and the opposite type with probability un- Under certain assumptions, 
they show that the genealogy of a sample from the population can be described using what they 
call an ancestral selection graph. Additional work of Donnelly and Kurtz (1999) and Barton, 
Etheridge, and Sturm (2004) has incorporated recombination as well as selection into the model. 

The ancestral selection graph arises in the limit as A?^ — > 00 in the case of weak selection, where 
the selective advantage A2/A1 — 1 and the mutation rates un are 0{1/N). Then, as A" — > cxd the 
fraction of individuals with the favored allele can be approximated by a diffusion process. In this 
paper, we consider strong selection, where the selective advantage is 0(1). With strong selection, 
when a beneficial mutation occurs, there is a positive probability that the beneficial allele will 
spread to the entire population, an event known as a selective sweep. 

At the end of a selective sweep, the entire population has the favorable allele, and every 
member of the population will trace that favorable allele back to the individual that had the 
beneficial mutation that caused the selective sweep. However, the genealogy becomes more 
complicated when we consider recombination. Diploid individuals usually do not inherit an 
identical copy of one of their parent's chromosomes. Instead, the inherited chromosome consists 
of pieces of each of a parent's two chromosomes. Since a chromosome is coming from two places, 
we need to consider the genealogy not of an entire chromosome but of a particular site of interest 
on the chromosome. When a selective sweep is caused by a beneficial mutation at a site other 
than the site of interest, many individuals may trace their gene at the site of interest back to 
the individual that had the beneficial mutation at the beginning of the selective sweep, while 
others may trace their gene at the site of interest to a different ancestor because of recombination 
between the two sites on the chromosome. This effect was first studied by Maynard Smith and 
Haigh (1974), who called it the "hitchhiking effect." 

As wc will show, the typical duration of a selective sweep is only 0(log A). Therefore, when 
we speed up time by a factor of N to define the ancestral process, the selective sweep takes place 
almost instantaneously. Consequently, if we sample n individuals some time after a selective 
sweep and define the ancestral process as before, the ancestral process behaves like Kingman's 
coalescent Tintil wc get back to the time of a selective sweep. At that time, many lineages may 
coalesce because they get traced back to the individual with the mutation that caused the selective 
sweep. This possibility was observed by Gillespie (2000), who referred to the resulting coalescent 
process as the "pseudohitchhiking model." We will show that if selective selective sweeps happen 
repeatedly throughout the history of a population at times of a Poisson process, as proposed 
by Gillespie (2000), then under suitable assumptions the ancestral processes will converge as 
AT ^ 00 to a coalescent with multiple collisions, which is a 7^„-valued Markov process in which 
many blocks of the partition can merge at once into a single block. These coalescent processes 
were introduced by Pitman (1999) and Sagitov (1999). 

While coalescents with multiple collisions are the limiting coalescent processes as N 00, 
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an improved approximation for finite can be obtained using a coalescent with simultaneous 
multiple collisions. Coalescents with simultaneous multiple collisions, which were introduced 
by Schweinsberg (2000) and Mohle and Sagitov (2001), are coalescent processes in which many 
blocks can merge at once into a single block, and many such mergers can occur simultaneously. 
They provide a better approximation than coalescents with multiple collisions in this context 
because, as noted by Barton (1998), Durrett and Schweinsberg (2004a), and Schweinsberg and 
Durrett (2004), multiple groups of lineages can coalesce at the time of a selective sweep. 

Coalescents with multiple or simultaneous multiple collisions arise as limits of ancestral pro- 
cesses in populations that occasionally have very large families because ancestral lines that go 
back to an individual with many offspring will coalesce at the same time. Coalescents with mul- 
tiple collisions arise when a single large family is possible in a given generation, while coalescents 
with simultaneous multiple collisions arise when one generation can contain many large families. 
For more details, see Sagitov (1999, 2003), Mohle and Sagitov (2001), and Schweinsberg (2003). 
The results in this paper provide a different biological application of these coalescent processes. 

The rest of this paper is organized as follows. In section 2, we describe our model for how 
the population evolves when there can be beneficial mutations. We state our main result, which 
is that the genealogy of this process converges to a coalescent with multiple collisions. In section 
3, we present the improved approximation involving a coalescent with simultaneous multiple 
collisions. The next two sections are devoted to applications of these results. In section 4, we 
discuss how multiple mergers affect the number of segregating sites and pairwise differences in a 
sample of DNA. These quantities are used in Tajima's Z)-statistic (see Tajima (1989)), which can 
be used to detect departures from the standard Kingman's coalescent. In section 5 we discuss 
how multiple mergers affect the mimbcr of mutations that appear on just a single individual in 
the sample, which is relevant to the test proposed by Fu and Li (1993) for detecting departures 
from Kingman's coalescent. Our results suggest that Fu and Li's test should have less power to 
detect selective sweeps, at least in large samples, than Tajima's D-statistic. Finally, in section 6, 
we prove the convergence and approximation theorems stated in sections 2 and 3. 

2 Convergence to a coalescent with multiple collisions 

In this section, we give a precise description of our model of a population that experiences 
beneficial mutations, and we state our main convergence theorem. We describe what happens 
following a single beneficial mutation in subsection 2.1, and we consider recurrent beneficial 
mutations in subsection 2.2. Then in subsection 2.3, we state the convergence result and give 
some examples. 

2.1 The effect of a single beneficial mutation 

In this subsection we describe how the population evolves after one of the 2N individuals expe- 
riences a beneficial mutation. We will denote the new favorable allele by B and the other allele 
by h. We assume the relative fitnesses of the two alleles are 1 and 1 — s, so the B alleles will tend 
to survive longer. Immediately after the mutation, one individual has the B allele and 2N — 1 
have the h allele. Kaplan, Hudson, and Langley (1989) and Stephen, Wiehe, and Lenz (1992) 
proposed modeling the fraction of individuals p{t) with the B allele at time t by using the logistic 
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differential equation 

Tliis approach lias been popular in simulation studies. However, Durrett and Schweinsberg 
(2004a) showed that this approximation is not very accurate. Consequently, we will consider 
instead a modification to the Moran model that was studied by Durrett and Schweinsberg (2004a) 
and Schweinsberg and Durrett (2004). 

At one site, each chromosome has a B or b allele, but wc will be interested in the genealogy 
at another neutral site at which all alleles have the same fitness. As in the Moran model, 
each individual survives for a time that is exponentially distributed with mean 1, and then a 
replacement is proposed in which the parent of the proposed new individual is chosen at random 
from the 2A^ members of the population. However, to account for natural selection, whenever 
a replacement of a i? chromosome with a b chromosome is proposed, the change is rejected 
with probability s. Also, to incorporate recombination into the model, we say that when a new 
individual is born, it inherits its alleles at both sites from the same parent with probability 1 — r. 
However, with probability r, there is recombination between the two sites, so the new individual 
inherits its allele at the neutral site from its parent's other chromosome. Because we are treating 
an individual's two chromosomes as two separate members of the population, we model this by 
saying that, with probability r, the new individual inherits the two alleles from two ancestors 
chosen independently at random from the population. 

Suppose the beneficial mutation appears on one chromosome at time 0, and let X(t) be the 
number of chromosomes with the favorable allele at time t. Let r = inf{t : X(t) G {0, 2A'"}} be the 
time at which either the B ox b allele disappears from the population. Suppose we take a random 
sample of n individuals from the population at time r. Let Q be the partition of {1,... ,n} 
such that i and j are in the same block of G if and only if the ith and jth individuals in the 
sample have the same ancestor at time zero when we follow the ancestral lines associated with 
the neutral site of interest. The partition O then describes how the beneficial mutation affects 
the genealogy of the sample. We have the following result concerning the distribution of Q. Here 
Qp,ni for p G [0, 1], is the distribution of a random partition H obtained as follows. First, define a 
sequence of independent random variables (Ci)iLi such that P{^i = 1) = p and P(^i = 0) = 1 — p 
for i = 1, . . . , n. Then define 11 such that one block of 11 consists of {z < n : = 1} and the 
remaining blocks of H are singletons. 

Proposition 2.1. Fix n G N, and fix s G (0,1). Assume there is a constant C such that 
r < C'/{\ogN) for all N. Let a = r\og{2N)/s, and let p = e"". 

1. There exists a positive constant C , depending continuously on s and a but not depending 
on N, such that |P(e = tt\X{t) = 2N) - Qp,niT^)\ < C/(log AT) for all irePn- 

2. Let kq be the partition o/{l,... ,n} into singletons. There exists a constant C, depending 

continuously on s and a but not depending on N , such that P{@ ^ kq and X{t) = 0) < 
CiV-V2. 

Note that in this proposition, the selective advantage s is assumed to be fixed, but the 
recombination probability r depends on N. Part 1 of the proposition, which is a restatement of 
Theorem 1.1 of Schweinsberg and Durrett (2004), implies that as N —>■ oo, the distribution of 6, 
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conditional on the event that a selective sweep occurs, converges to Qp^^ where p represents the 
approximate fraction of lineages that coalesce at the time of the selective sweep. Part 2 of the 
proposition, which we prove in Section 6, shows that lineages typically do not coalesce when the 
favorable B allele dies out. The probability that a selective sweep occurs, and therefore Part 1 
of the proposition applies, is — (1 — s)'^^) (see Durrett (2002) or Schweinsberg and Durrett 
(2004)). 

2.2 A model with recurrent beneficial mutations 

To model a population in which beneficial mutations can occur repeatedly, we assume that 
beneficial mutations at different points on the chromosome occur at times of a Poisson process. 
The selective advantage that these mutations provide and the rate of recombination between the 
site of interest and the site of the mutation will be random. When there is a beneficial mutation 
in the population, the population will evolve as described in the previous subsection. Between 
these times, the population will follow the standard Moran model. 

To be more precise, we will consider the chromosome to be the line segment [—L,L\. Our 
goal will be to describe the genealogy of the site 0. For each N, the beneficial mutations will be 
governed by a Poisson process K]s[ on M x [— L, L] x [0, 1]. If (t, x, s) is a point in .fCAr, then at time 
t, a mutation, which provides a selective advantage of s, will appear at location x on one of the 2N 
chromosomes. The intensity measure of Kj^ will be A x //jv, where A denotes Lebesgue measure on 
M and /xtv is a finite measure on [— L, L\ x [0, 1] which governs the rates of beneficial mutations. The 
recombination probabilities will be determined by a function tm '■ [—L^L] —i- [0, 1]. We assume 
that rjv(O) = and r^v is nonincreasing on [— L, 0] and nondecreasing on [0,L]. Beginning at 
time t, the population will evolve according to the model described in the previous subsection of 
a population with a beneficial allele having selective advantage s and recombination probability 
rAr(x). We let r(t) denote the first time that the beneficial mutation that appears at time t either 
disappears from the population or is present on all 2N chromosomes. 

Let T/v = {t '■ {t, X, s) is a point in Kn for some x and s} be the times at which beneficial 
mutations are proposed. Note, however, that we can not define the evolution of the population 
as explained above if, for some ti,t2 G Z/Vj the intervals [ti,r(ti)] and [t25''"(^2)] overlap. There 
has been some work in the biology literature on the question of how a selective sweep is affected 
by another selective sweep happening at the same time (see, for example, Barton (1995), Gerrish 
and Lenski (1998), and Kim and Stephen (2003)). However, as we will show, in our model this 
overlap occurs too infrequently to have any affect on our results, so we avoid the issue of defining 
the population during periods of overlap by allowing a new beneficial mutation to occur only when 
there is no other beneficial mutation currently in the population. That is, beneficial mutations 
will occur at the times in. = {t E Tn : r(n) < t for all n G T/v such that u <t}. Let 

In= [j [t,T{t)]. 

A beneficial mutation will be present in the population at time u if and only if n G In- For the 
intervals in I^, the evolution of the population was defined in subsection 2.1. For the times in 
M \ Xjv , we will say that the population evolves according to the standard Moran model so that 
the evolution of the population is well-defined for all of R. 

To define the ancestral process = i^Nit),t > 0), we sample n of the 2N individuals 
at random from the population at time zero. We then define ^'Ar(i) to be the partition of 



5 



{l,...,n} such that i and j are in the same block of ^Ar(t) if and only if the ith and jth 
individuals in the sample got their allele at location on the chromosome from the same ancestor 
at time —Nt. Note that we are again speeding up time by a factor of N so that, if there are no 
beneficial mutations (i.e. if ^at is the zero measure), the ancestral process ^'at = (^'Ar(t),t > 0) 
is Kingman's coalescent. When we do have beneficial mutations, the ancestral processes will 
converge as — > oo, under suitable conditions, to a coalescent with multiple collisions. 

2.3 The main convergence theorem and examples 

Pitman (1999) introduced coalescents with multiple collisions, in which many blocks of the parti- 
tion can merge into one. These coalescent processes are in one-to-one correspondence with finite 
measures A on [0, 1], and the coalescent process associated with a particular measure A is called 
the A-coalescent. We will consider here only 7^„-valued coalescents because they are what we 
will need to approximate the genealogy of a sample of size n. However, the constructions can be 
extended, using Kolmogorov's Extension Theorem, to yield coalescent processes that take their 
values in the set of partitions ofN = {l,2,...}. 

Suppose (n„(t), i > 0) is the Pn-valued A-coalescent. Then n„(0) is the partition of {1, . . . , n} 
into singletons. If n„(t) has b blocks, then every possible transition involves merging k of the 
blocks into one, where 2 < k <b. Denoting the rate of this transition by Xb,k, we have 



If A = 6q, where denotes a unit mass at zero, then every transition that involves two blocks 
merging into one happens at rate one, and no other transitions are possible. Thus, the 6o- 
coalescent is Kingman's coalescent. 

The theorem below states that when we do have beneficial mutations, the ancestral processes 
converge as — > oo, under suitable conditions, to a coalescent with multiple collisions. The 
multiple mergers happen at times of selective sweeps. Note that the convergence is in the sense 
of finite-dimensional distributions. Convergence in the stronger Skorohod topology does not hold 
because, during the short time intervals when selective sweeps are taking place, may undergo 
multiple transitions. 

Theorem 2.2. Let /i be a finite measure on [—L,L] x [0,1], and let r : [—L,L] [0,oo) be a 
bounded continuous function such that r(0) = and r is nonincreasing on [—L, 0] and nonde- 
creasing on [0, L]. Suppose that, as N ^ oo, the measures Nfj,]\f converge weakly to n and the 
functions (log2A^)rjv converge uniformly to r. Let r] be the measure on (0,1] such that 



for all y G (0, 1]. Let A be the measure on [0, 1] defined by A = Sq -|- Aq, where Ao((ix) = x'^ri{dx). 
Let n = (n(t),t > 0) be the Vn-valued A-coalescent. Then, as N ^ oo, the finite- dimensional 
distributions of^]\f converge to the finite- dimensional distributions ofU. 

Note that in Theorem 12. 2( the recombination probability is 0(l/(log A^)). The function r is 
assumed to be monotone on [—L, 0] and [0, L] because the greater the distance between and the 
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site of the mutation, the greater the hkehhood of recombination between the two sites. Also, the 
rate of beneficial mutations is 0{1/N), so that the multiple mergers caused by selective sweeps 
and the ordinary mergers of two lineages at a time are happening on the same time scale. If the 
rate of selective sweeps were o(l/A^), then the multiple mergers would disappear in the limit. If 
selective sweeps occurred on a faster time scale than 0{1/N), then the multiple mergers would 
dominate for large and the limiting coalescent would have no 60 component. Gillespie (2000) 
considers this possibility and proposes that it may explain why observed genetic variation does 
not appear to be as sensitive to population size as Kingman's coalescent model predicts. However, 
in this paper we focus on the case in which both types of mergers happen on the same time scale. 
We now derive the limiting coalescent with multiple collisions in two natural examples. 

Example 2.3. Consider the case in which we are concerned only with mutations at a single site, 
all of which have the same selective advantage. Fix q > 0, and let fj,]\f = OiN~^5(^z,s) some 
s G (0, 1] and z G [—L, L]. This means that beneficial mutations that provide selective advantage 
s appear on the chromosome at site z at times of a Poisson process. The measures Nfi^ converge 
to n = Oi6(^z,s)- Assume that the recombination functions r^r are defined such that the sequence 
(log2A^)rAr converges uniformly to r, and let /? = r(z). Then, for all y G (0, 1], we have 

ViivA]) ^ j ^^l{e-'-W/">y} f^idx X du) = Sal^^-0/s>yy 

Therefore, rj consists of a mass sa at p = e~^/*. It follows from Theorem 12.21 that the limiting 
coalescent process is the A-coalescent, where A = 5o + sap^dp. Thus, in addition to the mergers 
involving just two blocks, we have coalescence events at times of a Poisson process in which we 
flip p-coins for each lineage and merge the lineages whose coins come up heads. 

Example 2.4. It is also natural to consider the case in which mutations occur uniformly along 
the chromosome. For simplicity, we will assume that the selective advantage s is fixed. Let 
A denote Lebesgue measure on [— L,L]. Suppose fiN = N~^{aX x 6s), so the measures Njj.^ 
converge to ^ = a A x 6s- To model recombination occurring uniformly along the chromosome, 
we assume that the functions (log2A^)rjv converge uniformly to the function r{x) = P\x\, so 
the probability of recombination is proportional to the distance between the two sites on the 
chromosome. For all y G (0, 1], we have 

Vi[y, 1]) j ^ l{e— (-)/->s/} dx = as j ^ l{e-/3|-l/=>2;} dx. 
Since e"^'^'!/** > y if and only if \x\ < —{s/f3){logy), we have 

viivA]) = min | ^"'^^ ^ , 2asL|. 



Therefore, for y > e we have 



Let c = 2as^/l3. It follows that 77 has a density given by gL{y) = cjy for e ^^1^ < y < 1 
and giiy) = otherwise. By Theorem 12.21 the finite-dimensional distributions of the ancestral 
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processes converge to those of the A-coalescent, where A = (^o + and Aq has density 
hL{y) = iP'gLiy)- Note that as L — > cxd, the density hi{y) converges to h{y), where h{y) = cy for 
y € [0, 1] and h{y) = otherwise. We can think of this as the hmiting coalescent for an infinitely 
long chromosome. 

Example 2.5. Finally, we show that any A-coalescent with a unit mass at zero can arise as a 
limit of ancestral processes in this model. We first show how to obtain coalescents of the form 
A = (5o + Aq, where Aq is a finite measure on [e, 1] and < e < 1. Note that in Theorem 12.21 
we have Ao((ix) = x'^r]{dx), so it suffices to show that /x and r can be chosen to make t] an 
arbitrary finite measure on [e,l]. Let G : [e, 1] — > [0, oo) be any nonincreasing left-continuous 
function. We will choose fi and r so that r]{[y, 1]) = G{y) for e < y < 1 and r/([0, e)) = 0. Let 
L = — ^loge, and let u be the measure on [— L, L] such that z^([— L,0)) = and, for e < y < 1, 
i/([0, — ^ log y]) = 2G{y). Suppose r{x) = \x\ and ^ = u x 6i/2- Then, for e < y < 1, 

V{[y, ^ j ^l{e-'-W/»>y} Kdx X ds) 
1 /-^ 1 
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l{e-2->3;} J^{dx) = -i/([0,-(logy)/2]) = G(y), 



as claimed. Thus, we can get the A-coalescent in the limit if Ao((0,e)) = 0. We can obtain an 
arbitrary A-coalescent by then taking a limit as L — > oo (or e J, 0) as in Example 12.41 



3 Approximation by a coalescent with simultaneous multiple col- 
lisions 

A key ingredient in the proof of Theorem 1221 is part 1 of Proposition l2.lL Part 1 of Proposition l2.1l 

says that, up to an error of 0(l/(log A^)), we can approximate the effect of a selective sweep on 
the genealogy by flipping a p-coin for each lineage and merging the lineages whose coins come up 
heads. However, Durrett and Schweinsberg (2004a) observed in simulations that for A'^ between 
10,000 and 1,000,000, the approximation in Proposition 12.11 works poorly, largely because it is 
possible for multiple groups of lineages to coalesce at the time of a selective sweep. By taking 
this into account, they were able to give a more complicated approximation that works much 
better in simulations and has an error of only 0(1/ (log A^)^). 

Before stating this result, we review Kingman's (1978) paintbox construction of exchangeable 
random partitions of {1, . . . , n}. Let 

oo 

A = {{xi,X2, . . .) : xi> X2 > ■ ■ ■ >0,^Xi <l], 

i=l 

and let G be a probability measure on A. We define a G-partition n of {1, . . . , n} as follows. Let 
Y = (11,12, . . . ) be a A-valued random variable with distribution G. Define a sequence {Zi)f^-^ 
to be conditionally i.i.d. given Y such that P{Zi = j\Y) = Yj for all positive integers j and 
P{Zi = 0\Y) = 1 — X]jt=i ^j- Then define H to be the partition such that distinct integers i and 
j are in the same block if and only if Zi = Zj > 1. We denote the distribution of a G-partition 
of {1, ... , n} by Qcn- Note that if G is a unit mass at (p, 0, 0, . . . ), then Qcn = Qp,n- 



8 



Next, we define a family of distributions R{6, M) on A by using a stick-breaking construction. 
Let G [0,1], and let M be a positive integer. Let (VFfc)^2 be independent random variables 
such that Wk has a Beta(l, k — \) distribution. Let {C,k)^=2 ^ sequence of independent random 
variables such that P(Cfc = I) = 9 and P(Cfc = 0) = 1 - 6^ for all k. For A; = 2, 3, ... , M, let 

= Cfe^fe- To perform the stick breaking, we first break off a fraction Wm of the unit interval, 
then break off a fraction Wm-i of what is left over, and so on until we get down to W2- For 
A; = 2, . . . , M, the length of the fcth fragment is Yj^ = Iljlfe+i ~ ^^"^ length of the first 
fragment is Yi = \{f^^{l - Vj). Note that Y.k=i ^k = 1- Let Y = (Yi, Yg, . . . , ^m, 0, 0, . . . ) G A 
be the sequence obtained by ranking the interval lengths Yi , . . . , Ym in decreasing order and then 
appending an infinite sequence of zeros. Finally, let R{6, M) be the distribution of Y . 

These distributions R{9,M) were studied in Durrett and Schweinsberg (2004b), who used 
them to approximate the distribution of family sizes in a Yule process with infinitely many types. 
They arise in the proposition below because, after a beneficial mutation, the number of lineages 
with the B allele that do not eventually die out can be approximated by a Yule process. The 
result below is Theorem 1.2 of Schweinsberg and Durrett (2004). 

Proposition 3.1. Fix n G N, and fix s & (0,1)- Assume there is a constant C such that 
r < C'/{logN) for all N. Let a = rlog{2N)/s, and let p = e~". Then there exists a positive 
constant C , depending continuously on s and a hut not depending on N , such that 

\P{Q = 7r|X(r) = 2N) - QR(^r/s,V2Ns\),n{^)\ < C/{\ogNf 

for all n EVn, where [mj denotes the greatest integer less than or equal to m. 

Because the improved approximation allows many groups of lineages to coalesce at the time 
of a selective sweep, this result suggests that, for finite N, a coalescent with simultaneous mul- 
tiple collisions should provide a better approximation of the ancestral process than a coalescent 
with multiple collisions. Coalescents with simultaneous multiple collisions, which were studied 
by Mohlc and Sagitov (2001), Schweinsberg (2000), and Bertoin and Le Gall (2003), have the 
property that many blocks can merge at once into a single block, and many such mergers can 
occur simultaneously. Coalescents with simultaneous multiple collisions are in one-to-one corre- 
spondence with finite measures S on A. 

Suppose TT is a partition of {l,...,n} whose blocks are Bi, . . . , Bm, and suppose tt' is a 
partition of {1, . . . , n'} with n' > m whose blocks are B[, . . . , B'f^. Following Bertoin and Lc Gall 
(2003), define the coagulation of vr by it' to be the partition whose blocks are given by Uipr' 
for z = 1, . . . , /c. Suppose (n„(i),i > 0) is the P„-valued S-coalescent. If there are b blocks at 
time t— and a merger occurs at time t, then there exists a unique partition tt E Vb such that 
Unit) is the coagulation of n„(f— ) by vr. If vr has r + s blocks, s of which are singletons and 
the other r of which have sizes ki, . . . ,kr > 2, where b = ki + ■ ■ ■ + kr + s, then the rate of this 
transition is oo i 

h;ku-,kr;s = Qs.,bi^) ^ xfj Eo{dx) + al^r=iM='2}^ (3-1) 

where 6x denotes a unit mass at x = {xi,X2, ■ ■ ■) G A and S has been written as a(^(o,o,...) + 
with Ho({(0, 0, . . . )}) = 0. Coalescents with multiple collisions are a special case in which S is 
concentrated on points in which only the first coordinate is nonzero. 
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Coalescents with multiple and simultaneous multiple collisions can be constructed from Pois- 
son point processes (see Pitman (1999) and Schweinsberg (2000)). Consider a Poisson process on 
(0, oo) X Vn whose intensity measure is the product of Lebesgue measure on (0, oo) and a measure 
L on Vn defined as follows. Let S C Vnhe the set of all partitions consisting of one block of size 
2 and n — 2 singletons. If vr € Vn, let L^tt) = if vr is the partition consisting of n singletons. 
Otherwise, let 

L{7t) = I^Qs^^n{Tr)(^J2x]^ Ho((ix)+al|,e5}- (3-2) 

Since L is a finite measure, it is easy to define n„ = (n„(t),t > 0) such that n„,(0) is the 
partition consisting of n singletons and, at the times of points (t, vr) of the Poisson point process, 
the partition n„(t) is the coagulation of n„(t— ) by vr, and these are the only jump times of n„. 
This coalescent process is the valued H-coalescent. The construction of the A-coalescent is 
the same, except that if vr has at least one block that is not a singleton, we define 

L(vr) = / Qp,„(vr)p~2 Ao(dp) + al{^g5}, (3.3) 
Jo 

where A = (^o + Aq and Ao({0}) = 0. 

Under some additional assumptions, most significantly restricting the selective advantage 
resulting from each beneficial mutation to be at least e > 0, we are able to obtain bounds 
on the difference between the finite-dimensional distributions of ^'at and the finite-dimensional 
distributions of the approximating coalescent process. Proposition 13.21 below shows that indeed 
the coalescent with simultaneous multiple collisions gives a more accurate approximation. 

Proposition 3.2. Let ji he a finite measure on [—L,L\ x [e, 1], where e > 0, and let r : [—L,L\ 
[0, 1] he a function such that r(0) = and r is nonincreasing on [—L, 0] and nondecreasing on 
[0,L]. Suppose that, for all N, we have fijy = N^^fi. Also, assume that r]y{x) = r{x)/ log{2N) 
for all N and x. Fix times < -ui < • • • < Um, and let tti, . . . , iim G Vn- 

1. Define r] and A as in Theorem Let 11 = (n(t),t > 0) he the Vn-valued A-coalescent. 
Then there exists a constant C such that 

(J 

\P{'^N{ui) = TTi for i = 1,. . . ,m) - P(n{ui) = ni for i = 1, . . . ,m)\ < 



IorN 



2. Let Gn he the measure on A such that for all measurable subsets A C A, we have 
Gn{A)= [ [ sR{rNix)/s,l2Ns\){A) fi{dx X ds). 

J-L Jo 

Let Sjv be the measure on A given hy = (^(o,o,...) + "Af,0; where '^n,o is defined by 
E]\ffi{dx) = {Yl'jLix'j)G]\f{dx). Let Tn = {T]\j{t),t > 0) he the Vn-valued Epf -coalescent. 
Then there exists a constant C such that 

C 

\P{'^N{ui) = TTi for i = 1, . . . ,m) - PiTN{ui) = iTi for i = 1, . . . ,m)\ < 



(logiV)2 
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4 Segregating sites and pairwise differences 



One motivation for modeling a population that experiences recurrent selective sweeps by coales- 
cents with multiple or simultaneous multiple collisions is that these coalescent models can provide 
insight into tests used to detect selective sweeps. In view of part 2 of Proposition 13.21 and the 
simulation results in Durrett and Schweinsberg (2004a), there should be little loss of accuracy in 
studying the behavior of these tests under the assumption that the genealogy of a sample follows 
a coalescent with simultaneous multiple collisions. One commonly used test is based on Tajima's 
D-statistic (see Tajima (1989)). Given a sample of n strands of DNA from the same region on 
a chromosome, let Ajj be the number of sites at which the ith and jth segments differ, and let 

~ (2) ^ average number of pairwise differences over the (2) possible pairs. 

Let Sn be the number of segregating sites in the sample, that is, the number of sites at which at 
least one pair of segments differs. Tajima's D-statistic compares the statistics A„ and Sn- 

Suppose the ancestral history of a sample of individuals is given by a coalescent with 
multiple or simultaneous multiple collisions. Let Af, be the total rate of all mergers when the 
coalescent has b blocks. Assume that, on the time scale of the coalescent process, mutations 
happen at rate 9/2. Any mutation on the ith. or jth lineage before these lineages coalesce will 
cause the ith and jth segments to differ at some site. Since the expected time for these lineages 
to coalesce is A^"^, we have E[Aij] = OX^^. Therefore 

E[An] = 0X2'. (4.1) 

Note that A2 = A([0, 1]) for coalescents with multiple collisions and A2 = S(A) for coalescents 
with simultaneous multiple collisions. 

To calculate the expected number of segregating sites, we note that any mutation in the 
ancestral tree before all n lineages have coalesced into one adds to the number of segregating 
sites. If, at some time, the coalescent has exactly b blocks, the expected time that the coalescent 
has b blocks is . Let Gn{b) be the probability that the coalescent, starting with n blocks, will 
have exactly b blocks at some time. Then 

E[Sn] = -Y,bX^'Gnib). (4.2) 

b=2 

Although we do not have a closed- form expression for Gn{b), these quantities can be calculated 
recursively because (|2.1() and 1)3.11) allow us to express Gn{b) in terms of Gk{b) for k < n. As a 
result, it would not be difficult to evaluate the expression in ()4.2j) numerically. 

Suppose the ancestral process is given by Kingman's coalescent, which would be the case 
if there were no selective sweeps. Then A^ = (2) for all b > 2. Also, the number of blocks 
never decreases by more than one at a time, so Gn{b) = 1 whenever 2 < b < n. It follows that 
E[An] = 9 and 

^[5n] = |EK2 =^E53T = ^^"-i' (4.3) 

6=2 ^ ^ 6=2 

where hn-i = Y17=i / ''') • Thus, E[An — Sn/ hn-i] = 0. This observation is the basis for Tajima's 
Z'-statistic, which is given by 

D = ^n-Sn/hn _^ ^^^^^ 

\/ O-nSn + bnSn{Sn ~ 1) 
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where Un and 5„ are somewhat comphcated constants that are chosen to make the variance of D 
approximately one when the ancestral tree is given by Kingman's coalescent. See section 4.1 of 
Durrett (2002) for details. 

After a selective sweep, the new mutants will tend to have low frequency. As a result, a recent 
selective sweep should decrease A„ more than Sn, causing the numerator of Tajima's L)-statistic 
to be negative. Braverman et. al. (1995) found in simulations that Tajima's £)-statistic indeed 
tends to be negative after a selective sweep. Simonsen, Churchill, and Aquadro (1995) studied 
this question further and argued that unless the selective sweep was recent, Tajima's /^-statistic 
had relatively little power to detect selective sweeps. See also Przeworski (2002), who discusses 
the power of Tajima's I?-statistic to detect selective sweeps. Our coalescent approximation allows 
us to obtain the following result regarding the expected number of segregating sites when the 
population experiences recurrent selective sweeps. 

Proposition 4.1. Consider a A-coalescent in which A = 5q + Aq, where Ao({0}) = 0, or a 
r,- coalescent in which H = 5(o,o,... ) + "O o-''^d. Ho({(0, 0, ...)}) = 0. Let = \ — (2) • Suppose 

^ at log 6 

^— ^<oo. (4.5) 

b=2 

Then, there exists a constant p > such that 

lim E[Sn] - Ohn-i = -p. (4.6) 

n— >oo 

Furthermore, defining Goo(&) = hnin-^oo CnC^), we have 

^ = ^ E ^((2) ' - V') + { £ ^'^"(^ - ^^-'^ 



The condition (|4.5() prevents Aq or Hq from having too much mass near zero. Note that 
(|4.1|) implies that -E'[A„] decreases by a constant as a result of the beneficial mutations, while 
Proposition 14.11 implies that when (|4.5|) holds, E[Sn/hn-i] decreases by approximately p/hn-i, 
which is 0(l/(logn)). Therefore, Proposition 14.11 shows that for sufficiently large samples we do 
expect Tajima's D-statistic to be negative when the population is affected by recurrent selective 
sweeps. Before proving this proposition, we consider some examples. 

Example 4.2. Suppose, as in Example 12.31 we have a A-coalescent in which A = 5q + sap~'^6p. 
Since p-mergers occur at rate sa, we have Xb < (2) + sa and thus Of, < sa for all b. Condition 
(|4.5|) follows immediately. 

Suppose instead we have the A-coalescent of Example 12.41 where A = 5o + ^0 and Aoldx) = 
cx dx. Note that a?, is the same as the total merger rate of the Ao-coalescent when there are h 
blocks. Using the fact that if Z ~ Binomial(6, x) then P{Z >2) = l-{l-xf- bx{l - xf''^, 
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we have 

Jo 

c [ (1 - (1 - x)'' - bx{l - xf-^)x-^ dx<c I (1 - (1 - x)'')x~^ dx 



ab= / {I - {1 - xy - bx{l - xy-^)x-^ Ao{dx) 
'o 



i/b fl 

(1 - (1 - xf)x-'^ dx + c (1 - (1 - x)^)x"^ dx 

Ji/b 



< c 



r-l/b r-l 

I bdx + c X dx = c{l + logb), (4.: 

Jo Jl/b 



which imphes (|4.5|) . 

Example 4.3. Although (|4.5() holds in the natural cases given in Examples 12.31 and l2.41 we show 
here that it does not hold for all coalescents. Suppose A = + -'^O; where Aq is the uniform 
distribution on (0, 1). Note that there exists a constant C > such that if Z ~ Binomial(6, x) 
with X > 1/6 and 6 > 2, then P{Z >2)>C. Therefore, 

ab= [ (1 - (1 - x)'' - 5x(l - x)''-^)x-2 dx 
Jo 

>[ {1 - {1 - xf - bx{l - xf-^)x-^ dx 



i/b 

/b 



>C [ x-^ dx = C{b-l), 
Jl/b 



so (|4.5|) does not hold in this case. 

Proof of Proposition \4-l\ When the coalescent has n+1 blocks, the probability that the next coa- 
lescence event will take the coalescent down to fewer than n blocks is at most [A^+i — ("2 ^)]/A„+i. 
Therefore, if 2 < 6 < n, then 

\G^^i{b) - < ^^££1 = ^ < (4.9) 

A„+i A„+i n(n + 1) 

Therefore, when (|4.5I) holds, the sequence is Cauchy and thus has a limit Goo{b). 

It follows from and (jl^ that 

" /L\ -1 



E[Sn]-eK^r = -Y^bX.'Gnib) - 2 EH2 

b=2 b=2 ^ ^ 

h — O \ \ / / ir, 



+ ^^bX,\Gnib)-G^{b)). (4.10) 



b=2 



To prove ProDosition l4.1l we need to take the limit as n — > 00 of the three terms on the right-hand 
side of Km . 
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For the first term, we note that 



J A, - \2j 62(5 _ 1)2' 
Therefore, when (|4.5|) holds, we have a summable series and 



-J|:<v-G)->-;|<G)-'-v). 

For the second term, note that (|4.9|) and the fact that Gb{b) = 1 imply 

OO OO ^ OO ^ 

i:'V'(i-G.(.),<E^ 

6=2 6=2 ^m=b ^ 

(l + log(m- 1)) 

which is finite by H4.5p . Therefore, 

„ n „ OO 

6=2 6=2 

Finally, for the third term, 

n n ^ OO ^ 

limsup^6A,-i|G„(6)-Goo(6)| <limsupj;-- ^ ,TiJ 
n^oo f-:' n^oo f-;^ 6 - 1 V "i(m + 1) / 



m=n 

OO 



< lim ^— V ( V 

~ ^^J^^ logn 6-1 V ^ 



2a^+i log m 
_ , m(m + 1) 

6=2 m=n ^ ' 



2(1 + log(n - 1)) Y> 1am+\ log ra 
logn m(m + 1) 

m=n ^ ' 

by The proposition follows from dimj) . (EHU), (EUl), and (I^T^ . □ 



n— >oo 



< limsup "^^^ "^^""^^^ y """;+^^"^'" = (4.13) 

" logn ^-^ m(m + 1) 



5 The number of singletons 

Fu and Li (1993) proposed another test to detect departures from Kingman's coalescent. They 
considered the ancestral tree in which the leaves are the n individuals in the sample. They defined 
the branches connecting a leaf to an internal node to be external branches and the other branches 
to be internal branches. Let r\f. denote the number of mutations on external branches, and let r]i 
be the number of mutations on internal branches. Every mutation produces a segregating site, 
so r](, + r]i = Sn- If a mutation occurs on an external branch, the mutant gene appears on just one 
of the n individuals in the sample, while if a mutation occurs on an internal branch, the mutant 
gene appears on between 2 and n — 1 of the individuals in the sample. Therefore, to determine rj^, 
we simply count the number of mutations that appear on just one of the sampled chromosomes. 
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Note that unless an outgroup is available, it will not be possible to distinguish between a mutation 
that appears on one of the sampled chromosomes and a mutation that appears on n — 1 of the 
sampled chromosomes. Fu and Li (1993) proposed a modification of their test for when there 
is no outgroup, but for the analysis in this section, we assume that we have an outgroup that 
enables us to make this distinction. 

Let Jn be the sum of the lengths of the external branches. In terms of the associated coalescent 
process, J„ is the sum, over i between 1 and n, of the amount of time that the integer i is in 
a singleton block. Let be the sum of the lengths of the internal branches. Assuming, as 
before, that mutations occur at rate 9/2 on the time scale of the coalescent process, we have 
E[ve\Jn] = {0/2) Jn and E[ri,\In] = {e/2)In. 

Fu and Li's L'-statistic is based on comparing r]i with — l)r/e. Note that r/j — — l)r/e = 

Sn ~ hn-iije- To See that this has mean zero when the ancestral tree is given by Kingman's 
coalescent, we follow the explanation on p. 163 of Durrett (2002). In the case of Kingman's 
coalescent, (|4.3j) gives -E[5'„] = Ohn-i- Therefore, E[Sn — hn-ir]e] = 9hn-i — 9hn-iE[Jn]/2, so 
it remains to show that E[Jn] = 2. Let Kn be the amount of time that the integer 1 is in a 
singleton block of the partition, so E[Jn] = nE[Kn]- Let be the amount of time before the 
first coalescence event, and note that -©[T^] = 2/[n{n — 1)]. The probability that 1 coalesces with 
another integer at time T„ is 2/n, and this event is independent of T^. If 1 does not coalesce at 
this time, then the expected additional time that 1 is a singleton is E[Kn-i]- Therefore, we get 
the recursion 

r n 2 , , n-2 , , 2 n-2 , 

E[Kn] = -E[Tn] + E[Tn + Kn-l] = -r + E[Kn-l]. 

n n n{n — 1) n 

Note that -B[-f^2] = 1) and then it is easy to show by induction that i?[-fCn] = 2/n for all n, and 
so -EfJn] = 2 for all n, as claimed. 

We can write Fu and Li's L'-statistic as 

n= Sn-K-^ rj^ ^^^^^ 

where, as in 1)4.4(1 . c„ and d„ are constants chosen to make the variance of the statistic ap- 
proximately one when the genealogy is given by Kingman's coalescent. Details of the variance 
computation are given in section 4.2 in Durrett (2002), where an error of Fu and Li (1993) is 
corrected. 

When multiple mergers cause many lineages to coalesce at once, one expects /„ to be reduced 
more than J„ because there is still an external branch associated with each leaf, but there are 
fewer internal branches because of multiple mergers. This would cause Fu and Li's -D-statistic to 
be negative. The next proposition shows that this is indeed the case. 

Proposition 5.1. Let (n„(t),t > 0) be a Vn-valued A-coalescent in which A = 6o + Aq, where 
Ao({0}) =0, or a Vn-valued r.- coalescent in which H = (^(o,o,... ) + "O o,nd Ho({(0, 0, . . . )}) = 0. 
Let Of, = Xb — (2) , and suppose ( |^.5| ) holds. Then 

lim E[Sn - hn-ir]e] = -p, (5.2) 

71— »00 

where p is the constant defined in \4-'A )- 
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The key to the proof of this proposition is the following lemma. 
Lemma 5.2. Under the assumptions of Provosition [^ni there is a positive constant C such that 



0<i?[2-J„]<-V^ (5.3) 

6=2 



for all n>2. 



The first inequality in H5.3() . which does not require condition H4.5|) . shows that the expected 
sum of the lengths of the external branches is never greater than 2, which means that it is 
largest for Kingman's coalescent. The second inequality gives a rather sharp bound on the 
difference. Recall that in Example 12.31 we have < sa, so E[2 — J„] < C(log n)/n for some 
other constant C. In Example 12. 4| (|4.8|) gives aj, < c(l + log 6) < c(l + logn), which implies 
E[2 — Jn] < C"(logn)^/n for some constant C". Thus, in these examples, the lengths of the 
external branches are affected very little by multiple mergers when the sample size is large. The 
reason is that, in large samples, a lot of coalescence occurs very quickly, so most ancestral lines 
have merged with at least one other ancestral line before the first multiple merger takes place. 

Proof of Lemma \5.'A We start by proving the first inequality in ()5.3() by induction. As before, 
let Kn be the amount of time that the integer 1 is in a singleton block. We need to show that 
< 2/n for all n > 2. First, note that -E'[-ftr2] = ^ 1- Now, suppose for some n > 3, 
we have E[Kj\ < 2/j for j = 2, . . . ,n — 1, and consider E[Kn]. Let r„ be the time of the first 
merger when the coalescent starts with n blocks, and let i? > 2 be the number of blocks involved 
in the merger at time T„. Note that B is independent of T„. Conditional on B, the probability 
that 1 merges with at least one other block at time Tn is B/n. If this does not happen, then at 
least n — B + 1 blocks remain after the merger, so by the induction hypothesis, the expected time 
after T„ that {1} will remain a singleton is at most 2/(n — B + 1). Therefore, 

\n J \ n J\ n — B + lJ n[n — B + 1) 

Since 2<B <n, we have in-B)/{n-B + l) < (n-2)/(n-l). Also, E[Tn] = A^^ < 2/[n(n-l)], 
so 

^r.. 1 2 2(n-2) 2 

E[Kn] < -, + 4 -( = -, 

n{n — 1) n[n — 1) n 

which proves the first inequality. 

The proof of the second inequality requires a coupling argument. Let (n„(f),f > 0) be 
the coalescent process defined in the statement of Proposition 15.11 and let (T„(i),i > 0) be 
Kingman's coalescent, started from the partition of l,...,n into singletons. We may assume 
that the coalescent processes n„ and T„ are constructed from Poisson processes A^i and N2 
respectively on (0, cxd) x "P^, as described in section 3. That is, whenever (t, vr) is a point of 
A'"!, the partition n„(t) is the coagulation of n„(t— ) by vr, and whenever (i, tt) is a point of A''2, 
the partition Tn{t) is the coagulation of T„(t— ) by vr. Furthermore, these are the only jump 
times of n„ and T„. Let Li and L2 be the intensity measures of the second coordinate for the 
Poisson processes A'"i and N2 respectively. Then, for vr € Vn-, we have -^2(71") = 1 if tt consists 
of one block of size 2 and n — 2 singletons, and L2{tt) = otherwise. Also, Li{tt) > L2(vr) for 
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all TT E Vn- Therefore, we may assume that the Poisson processes A^i and are coupled such 
that if (t, tt) is a point of then (t,vr) is a point of A^i. The points (t, vr) in both A'^i and A''2 
correspond to mergers in which two blocks coalesce at a time, while the points (t, tt) in A^i but 
not correspond to multiple mergers caused by selective sweeps. 

To compare the two processes, note that = inf{t : {1} is not a singleton in n„(t)}, and 
let = inf{t : {1} is not a singleton in T„(t)}. We have E[Jn] = nE[Kn]- By our previous 
results for Kingman's coalescent, we have E[K!^] = 2/n, and so E[2 — J„] = nE[K'^ — Let 
r = inf{t : n„(i) ^ T„(f)}, where we say r = oo if n„(t) = T„(i) for all t. For vr € Vn, denote 
by |7r| the number of blocks in vr. Since n„(t) = T„(t) for all t < t, we have 

n 

E[2 - Jn] = nE[K'^ - Kn] < nE[{K'^ - t)1{,<k'J =nY,E[{K " T)l{.<x;}l{|T„(r)|=6}]- 

6=2 

For 6 = 1,2, ... ,n, define U = mf{t : |T„(t)| = b}. li t < K'^ and |T„(r)| = 6, then K'^ > Tb. 
Therefore, 

n 

E[2 - J„] < nY,E[K - r\{T < K'J n {|T„(t)| = b}]P{{K > n} n {|T„(r)| = 6}). (5.4) 

6=2 

If T < and |T„(t)| = b, then {1} is one of b blocks of T„(r), and by our previous results 
on Kingman's coalescent, the expected time before it merges with another block is 2/6. Thus, 
we have 

E[K - r\{T < K) n {|T„(r)| = b}] = ^. (5.5) 

Note that K'^ > whenever {1} remains a singleton at the time that Kingman's coalescent is 
down to b blocks. Whenever the coalescent goes from j blocks to j — 1, the probability that the 
integer 1 is involved in the merger is 2/j, so 

PiK>W-^{^-f) S»p(- t j) <exp(l-2/"i..) .e(^)^. (5.6) 

If |T„(t)| = 6, then both n„ and T„ have the same b blocks at time T^^ but at time r the 
process n„ has a transition but T„ does not. Since the total merger rate for n„ after time is 
= OLh+ (2) and the total merger rate for T„ after time is (2) , we have 

P(|T„(r)| = b\K >n)<^^< (5.7) 



Combining f|5.4p - (|5.7jl . we get 



E[2-J.]<n^ <-^-, 

h—o \ ' h—O 



6=2 ' ' 6=2 

which is the second inequality in (|5.3jl . □ 
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Proof of Proposition \5.1l We have 



E[Sn-hn.i7]e\ = {E[Sn] - OK-i) + K^i{e - E[rje]) = - ) + ^^^[2 - J„] . (5.8) 

By Proposition 14. 11 lim„_+oo(-E'[S'„] — = —p. It thus remains only to show that the second 

term on the right-hand side of ()5.8|) goes to zero as n — >• cxd. Let e > 0. By (|4.5I) . there exists a 
positive integer such that 

a6(l + log b) 



E 

b=N 



62 



< e. 



n „, n / N n 



Therefore, by Lemma 15.21 

hn-lO T 1 ^ T Chn-l9 sr^ at, Chn-l9 / s-^ at sr^ CXb\ 

hmsup -^E[2 - Jn] < hmsup T = ^^^^P Orv ( T + 2^ T 

0=2 0=2 0=^ 

< + — hmsup 2^ — -— < — hmsup p < — . 

0=^ 0=^ 

Since this is true for all e > 0, and since E[2 — J„] > for all n by Lemma l5.2| we have 

lim ^E[2 - A] = 0, 

which completes the proof of the proposition. □ 

We conclude this section with some comments about the power of Tajima's D-statistic and 
Fu and Li's Z?-statistic to detect selective sweeps. The numerators of these two statistics, which 
are A„ — Sn/hn-i and Sn — hn-irje , each have mean zero when the ancestral process is Kingman's 
coalescent. The expected values of these two numerators both converge to a negative constant 
as the sample size goes to infinity when multiple mergers can occur. These statistics are used to 
test for departures from Kingman's coalescent. If the goal is to test for multiple mergers caused 
by selective sweeps, one would reject the null hypothesis of no selective sweeps if the value of the 
statistic is too small (i.e. more negative than would be expected with Kingman's coalescent). 

A natural question, then, is how much power these tests have to detect selective sweeps. While 
a full analysis of this question would require a simulation study, we can obtain some insight from 
the analytical results presented above. From the values of a„ and 6„ in (|4.4j) . which can be found 
in section 4.1 of Durrett (2002), we see that the standard deviation of the numerator of Tajima's 
U-statistic is 0(1) when the genealogy is given by Kingman's coalescent. However, from the 
values of c„ and dn in 1)5. 1|) . which can be found in section 4.2 of Durrett (2002), we see that the 
numerator of Fu and Li's -D-statistic has a standard deviation which is 0(log n). This means that, 
for large n, moderate negative values for the numerator of Fu and Li's -D-statistic are not strong 
evidence against the null model of Kingman's coalescent, and thus a test based on Fu and Li's 
D-statistic will most likely have low power. These observations are consistent with simulation 
results of Simonsen, Churchill, and Aquadro (1995), who found that Tajima's D-statistic has 
more power to detect selective sweeps than Fu and Li's D-statistic. 

Neither of these tests has the desirable feature of many tests in classical statistics, which 
is that for all a > 0, the power of the level a test tends to 1 as the sample size n tends to 
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infinity. Indeed, for the problem of detecting recurrent selective sweeps, no such test based on 
the genealogy of the sample can exist because, with positive probability, none of the selective 
sweeps affects the genealogy of the n sampled lineages before we get back to the most recent 
common ancestor. We formulate this observation precisely in the following proposition, which 
uses the coupling in the proof of Lemma 15.21 

Proposition 5.3. Let (n„(t),t > 0) be the A-coalescent or E-coalescent defined in the proof of 
Provosition \5. 11 and assume that 

oo 

J]^<oo, (5.9) 

b=2 

which is slightly weaker than i4-5[ )- Let (T„(t),t > 0) he Kingman's coalescent, coupled with 
(n„(t),t > 0) as in the proof of Lemma \5.'A Then there exists a constant C > such that for all 
n, we have i-*(T„(t) = n„(t) for all t) > C . 

Proof. Let Tf, = inf{t : |Tn(t)| = 6}. Conditional on n„(Tft) = T„(r;,), the probability that 
n„(t) / T„(t) for some t G [rb,r5„i] is a^/Af,. It follows that 

P(T„(t) = n„(t) for all t) = \\(l-^\. 

Note that Ob/Xh < 2ah/[b{b — 1)] for all b. By 1)5. 9() . there exists a positive integer such that 
6a;,/ [6(6 — 1)] < 1 for all b > N, and if < a; < 1 then 1 — x/3 > e~^. Putting these results 
together, we get 



P{Tn{t) = n„(t) for all t 



where the last inequality uses (|5.9|) a| 




6 Proofs of convergence theorems 

In this section, we prove Theorem 12.21 and Proposition 13.21 The proofs use Propositions 12.11 
and 13.11 in combination with the Poisson process construction of coalescents with multiple or 
simultaneous multiple collisions. 

Recall the model presented in subsection 2.1 of how the population behaves following a single 
beneficial mutation. As in subsection 2.1, assume for now that a beneficial mutation occurs at 
time 0. Let X{t) be the number of chromosomes with the favorable B allele at time t, and 
let T = inf{t : X{t) G {0, 2iV}}. Let = < 6 < '^2 < • • • be the times of the proposed 
replacements, which occur at times of a rate 2N Poisson process. Let = < ?2 ^ • • • 

the subset of these times at which the number of individuals with the favorable allele changes. As 
observed in Schweinsberg and Durrett (2004), if 1 < < 2A^-1, then P{X{Ci+^) = k + l\X{^l) = 
k) = 1/(2 - s) and P{X{i[^^) = k - l\X{^[) = k) = {I - s)/(2 - s). Thus, the number of 



19 



chromosomes with the B allele behaves like an asymmetric random walk until it reaches or 2N . 
For integers i, j, and k such that 0<i<k<j< 2N and i < j, define 

p{i,j, k) = P(inf{s > t : X{s) = j} < inf{s > t : X{s) = i}\X{t) = k), 

which is the probability that if at some time there are k chromosomes with B, the number of 
B^s will reach j before i. Using the fact that (1 — s)^" is a martingale and applying the Optional 
Sampling Theorem, we get (see also Durrett (2002) or Lemma 3.1 of Schweinsberg and Durrett 
(2004)) 



1-(1 



1 - (1 - s} 



Therefore, the probability that the beneficial mutation leads to a selective sweep is p{0, 2N, 1) = 

,/(l_(l_,)2A^). 

Lemma 16.11 below shows that the length of time that the beneficial allele is present in the 
population is only O(logA^). Since we speed up time by a factor of to define the ancestral 
process, it will follow that for large populations, on the time scale of the ancestral process the 
lineages that coalesce as a result of a selective sweep coalesce almost at the same time. It is 
well-known (see Durrett (2002)) that a selective sweep takes time approximately (2/s) log(2A^). 
However, since a beneficial mutation leads to a selective sweep with probability approximately s, 
we get a bound on E[t] that does not depend on s. 

Lemma 6.1. We have E[t] < 4(log + 1). 

Proof. FoT 1 < k <2N -1, let Sk = #{i>0: X(4') = k} and Tk = #{i > : X{^i) = k}, where 
#S denotes the cardinality of a set S. Let qk = P{X{£^'-) ^ k for all j > i\X{^'^) = k) be the 
probability that the asymmetric random walk never returns to k. Note that E[Sk\Sk > 1] = 1/%- 
Also, P{X{ii) = k for some k) = p{0, k, 1) = s/(l - (1 - s)''). Therefore, 



E[Sk]=PiSk > l)E[Sk\Sk > 1] 



We have, for 1< /c < 2A^ - 1, 



Qk 



1 



> 



2-s 
1 - s 
2-s 

1- s 

2- s 
s 



1 



p{0,k,k-l)] + 
l-(l-s)'=-i" 



gfc(l-(l-s)^) 



p{k,2N,k + l) 



1-(1 
s(l - s)''-^ 
1 - (1 - s)^ 



sY 



+ 



1 - fl - s)2iV-fc 



+ 



1 



1-(1 



+ 1 



(1-. 

s 



\2N-k 



i2-s){i-{i-sn 



(6.1) 



It follows from this result and (|6.1I) that -E[S'fc] < 2 — s for all k. Schweinsberg and Durrett (2004) 
calculated that P{X{ii+i) / X{^,)\X{ii) = k) = k{2N - k){2 - s)/{2Nf. It follows that 



E[Tk] = E[Sk\ 



{2Nf 



k{2N - k){2 - s) 



< 



k{2N - k) 
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Since — = 1/2N for all i, we have 



1 9 /V 9 

= — y EiTk] < y T7 — TT < 2y T < 4(iogA^+i), 

k=l k=l ^ ' k=l 



as claimed. □ 

We now use this result to prove part 2 of Proposition 12.11 which shows that beneficial muta- 
tions do not cause lineages to coalesce when the beneficial gene dies out. 

Proof of part 2 of Provosition JKH Suppose X{t) = and B ^ kq. Then it can not be true 
that for all t € [0,r], the n individuals sampled at time r all have distinct ancestors with the 
6-chromosome at time t. Therefore, there is an integer i with £,i < t such that one of the following 
is true: 

1. The ancestor at time of one of the individuals sampled at time r has the b allele, but the 
ancestor of the same individual at time has the B allele because of recombination. 

2. There are two individuals in the sample at time r that have distinct ancestors with the b 
allele at time ^j, but both of them have the same ancestor at time 

We now calculate the probability of these events conditional on = k, where 1 < A; < 

We assume N > 2. For a randomly chosen b chromosome at time to have a B chromosome 
as its ancestor at time ^j-i, the chosen b chromosome must be the new one born at time (which 
has probability at most 1/(2N — k) because 2A^ — k chromosomes have the b allele at time .^j), 
there must be recombination at this time (which happens with probability r), and the ancestor at 
the site of interest must be a -B chromosome (which happens with probability at most {k + \) /2N 
because X(^j_i) < fc + 1). Therefore, the probability that all three events occur is at most 
r{k + l)/[(2iV - k){2N)] < r/N^/^. Also, at most one pair of b chromosomes at time can 
have the same ancestor at time so the probability that two randomly chosen b chromosomes 
coalesce at this time is at most (^^2"'')"^ = 2/[(27V - k){2N - k - 1)] < 2/N'^. 

By Lemma lO if M is the integer such that = r, then E[M] < (2iV) [4(log TV + 1)] = 
8iV(log + 1). Since there are n individuals and (2) pairs in the sample, combining these bounds 
gives 



(6.2) 



P{X{t) = 0, X{t) < N'/^ for all t, and G ^ kq) < 87V(log N + 1) (^-^ + ^^^^ 

Note that for 1 < A; < 2iV - 1, we have 

P{X{t) = and X{t) = k for some t) < P{X{t) = 0|X(t) = k for some t) 

= 1 - p(0, 2iV, k) = l- l2lll$N ^ (1 - ^ 
Therefore, 

P{X{t) = and X{t) > N^'"^ for some t) < (1 - s)^^'\ (6.3) 



,k 
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Combining H6.2I) and H6.3|) . we get 

PiXr = and e / Ko) < (1 - sf'^' + 8iV(log iV + 1) + ^^^^ 

Part 2 of Proposition 12.11 follows because r < C log{2N) and s is fixed. □ 

We now consider our model of recurrent selective sweeps and work towards the proof of 
Theorem 12.21 We will first define a coalescent with multiple collisions. We will then show that 
this process can be coupled with the ancestral process {'^N{t),t > 0) such that, given a finite 
number of times < ui < • • • < Um , the processes agree at these times with high probability. 

Recall that Kjsi is a Poisson point process on M x [—L,L] x [0, 1] with intensity A x //tv- We 
can define another Poisson point process on [0, oo) x [—L,L] x [0,1] which consists of all the 
points {—t/N,x,s) such that {t,x,s) is a point of K]sf and t < 0. By the Mapping Theorem for 
Poisson processes (see section 2.3 of Kingman (1993)), K"^ is a Poisson process with intensity 
measure A x Nfi]\f. The points in can be ordered by their first coordinate, so we can write 
the points as {ti,Xi, Sj) for positive integers i, where < ii < ^2 < • • • a-s. Also, define to = 0. 

We now define a P^-valued coalescent process Hn = {IlNit),t > 0). Let nAr(O) be the 
partition of {l,...,n} into singletons. Given Ili^{ti) for some i > 0, we define IlN{t) for 
ti < t < tj+i in two steps. First, we let the process obey the law of Kingman's coalescent over 
the interval (tj,ti+i), meaning that each possible transition that involves the merging of two 
blocks happens at rate one. Second, let tTj+i be a random partition of {1, . . . , n}, independent of 
(njv(t), < t < tj+i), such that for an event Ai^i of probability Sj+i, we have vTj+i = kq on 
and the conditional distribution of vTj+i given ^j+i is Qp^n, where p = e~'''v(^i+i)iog(2A^)/si+i _ ■\Ye 
then define Uiyiti+i) to be the coagulation of n7v(ti+i— ) by tTj+i. 

The lemma below states that the coalescent process Hn that we have just defined is a coa- 
lescent with multiple collisions. 

Lemma 6.2. Let r]]\f be the measure on (0, 1] such that 

r]N{[y, ^ j •5l{e-'-iv(-)i°g(2^)/=>j/} NiiN{dx X ds) 

for all y € (0,1]. Let Aq^at he the measure on (0,1] such that Ao^N^dx) = x'^rj^^dx), and let 
Aat = ^0 + Aq^at. Then the process {IlNit),t > 0) is the Vn-valued A.;^ -coalescent. 

Proof. Let K'^ be the point process on [0, oo) 'xVn consisting of the points (tj, tTj). By the Marking 
Theorem for Poisson processes (see section 5.2 of Kingman (1993)), K'^ is also a Poisson point 
process. Given {ti,Xi,Si), the partition vrj has distribution Qp^m where p = e~^^'^^'^^°s*^^^-'/'*% 
conditional on an event of probability and otherwise is kq. Therefore, the intensity measure of 
K'j^ is given hy X x H, where, for tt hq, 

H{t^) = / sQ^-rj^{=o)iog(2N)/s „(7r) N^N^dx x ds) 
J-L Jo 

1 rl 

2 



Qp,n{'^) VNidp) = / Qp,n{T^)p Ao,Ar((ip). 

Jo 

By comparing this with (|3.3j) and recalling that IT follows the law of Kingman's coalescent over 
the intervals (ti_i,tj), we conclude that Iljv is the Ajy-coalescent. □ 
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The next lemma states that it is unhkely for there to be a beneficial allele in the population 
at any fixed time. Recall that T/v = {t : {t, x, s) is a point in K]sf for some x and s}. 

Lemma 6.3. There exists a constant C , not depending on N , such that for any fixed y G M, we 
have P{y G [t,T{t)] for some t £ Tn) < {C log N)/N. 

Proof. The points of T/v form a Poisson process on R of rate ^jn, where = hnH—L, L] x [0, 1]). 
Recall from Lemma [6.1l that if r denotes the amount of time for which a beneficial allele is present 
in between 1 and 2N — 1 members of the population, then E[t] < 4(log + 1). Therefore, 

/y 
P{t > y — x)^N dx 
-oo 

/•oo 

= 1N I P{t >x)dx = 77v^[r] < 47Ar(log iV + 1). 

Since the measures N^n converge weakly to /x, the sequence {Njn)n=i converges to L, L] x 
[0, 1]) and therefore is bounded. The lemma follows. □ 

We now show how to couple the processes and Iljv so that they agree at a given finite set 
of times with high probability. We first consider how the ancestral process behaves around 
the times ti,t2, ■ ■ ■ ■ For positive integers i, let Tj = —T{—Nti)/N. We have —Nti G T^r. However, 
recall from subsection 2.2 that not all points in T/v are in Tj^ because some potential mutations 
are discarded to avoid overlapping selective sweeps. When —Nti G T^, there is a beneficial allele 
in the population during the time interval [—Nti,T(—Nti)), and this affects the process over 
the interval [rj, tj]. 

For each i such that —Nti £ T^j we can define a random partition 9i G Vn by choosing n 
individuals from the population at time T{—Nti) and declaring two integers j and k to be in the 
same block of 9i if and only if the jth and A;th individuals chosen got their allele at the neutral 
site of interest from the same ancestor at time —Nti. If Tj > and the partition ^'/v(Tj) contains 
bi blocks, we can choose the n individuals at time T{—Nti) by first picking the bi individuals 
that are ancestors of the n individuals that were sampled at time zero, and then choosing the 
remaining n — hi at random. This will ensure that, for i such that —Nti £ T^ and > 0, the 
random partition ^'7v(ti) is the coagulation of ^Ar(rj) by 6i. 

Moreover, the conditional distribution of 6i given {ti,Xi,Si) and given that —Nti £ Tj(f is 
the same as the distribution of the random partition G defined in subsection 2.1, when the 
selective advantage is Si and the recombination probability is r/v(xj). Recall that when a beneficial 
mutation occurs in the population with selective advantage Si, it spreads to the entire population 
with probability — (1 — Si)"^^). Therefore, by Proposition 12. ![ the distribution of Q is 

approximately that of a random partition that has distribution Qp^m where p = e~'^^(^»)^°s(2iV)/si^ 
on an event of probability — (1 — Si)^^) and is kq on the complementary event. However, this 
is the same as the conditional distribution of vTj given (t j , x j , s j ) , except we have Si/(1 — (1 — Sj)^^) 
instead of Sj. It thus follows from Proposition 12.11 that we can couple the partitions 9i and VTj 
such that for any 5 > 0, 

PiOi / Tii and - Nti G 'T}^\{U, x,, s,)) < + l{s,<5}, (6.4) 

where Cs is a constant that depends on 6. Note that we only get the 0(l/(log A^)) bound when 
Si > 6 because of the assumption in Proposition 12.11 that s is fixed. 
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Finally, we consider the processes during the intervals The process IItv behaves like 

Kingman's coalescent during these intervals. Let 

oo 

1=1 

The process ^ n behaves like Kingman's coalescent during the intervals in (0, oo) because the 
population follows the Moran model during the corresponding intervals. Therefore, if nAr(tj) = 
^^Niti), we can couple the processes so that IlN{t) = ^'Ar(t) for all t G [U, (pi), where (pi = mi{t > 
ti : t Tjy}- 

Proposition 6.4. Suppose the processes Hn oind ^ ^ are coupled in the manner described above. 
Let < ui < • • • < Ura be fixed times. Let e > 0. For sufficiently large N , we have 

P(IlN{ui) 7^ ^'Ar(ui) for some i E {1, . . . , m}) < e. (6.5) 



Proof. Let K = supjA; : t^ < u^}. Suppose the following conditions hold: 

1. For i = 1, . . . , m, we have Ui ^ I"^. 

2. For all positive integers i, we have Tj > 0. 

3. For i = 1, . . . , K , we have —Nti G T^. 

4. For i = 1, . . . , K , we have YlNin) = IlN{ti—). 

5. For i = I, . . . , K , we have 9i = iTi. 
Conditions 2 and 3 imply that 

= to < Ti < ti < T2 < t2 < ■ ■ ■ < TK < tx < Um- 

Condition 1 with i = m implies further that tj > for all j > K, so {tx, Um] C M\Xj^. We know 
that nAr(to) = ^^Nito) = kq. Suppose, for some i G {0, . . . , K — 1}, that I[N{ti) = ^'Ar(tj). Then 
the coupling gives IiN{t) = ^'Af(t) for all t S [tj,rj+i). Condition 4 gives nAr(rj+i) = I[]\f{ti+i—). 
Conditions 2 and 3 imply that ^'7v(ij+i) is the coagulation of ^'Ar(rj+i) by 6*4+1. Since Il]\f(ti+i) 
is the coagulation of n7v(ti+i— ) by vTj+i, condition 5 ensures that nAr(tj+i) = ^7v(ij+i)- Thus, 
^n{U) = ^Af (ii) for i = 0,1, . . . , K , and the coupling combined with the fact that Il]\f(tK) = 
^Nitx) gives Il]^{t) = ^Nit) for all t G {tK,Um]- Thus, we have nAr(t) = ^Nit) for ah t G 
[0,-Um] X^N- Therefore, by condition 1, Il]\f{ui) = ^]\f{ui) for i = 1, . . . ,m. It thus remains only 
to show that conditions 1 through 5 occur with high probability. For the rest of the proof, we 
allow the constant C to change from line to line. 

If Ui G X^, then there exists t G T/v such that —Nui G [t,T(t)]. Therefore, by Lemma 16. 3| 

ClogiV 

P{ui G 2"^ for some i G {1, . . . , m}) < — — — . 

Likewise, if Tj < for some i, then —Nti < < T{—Nti) and —Nti € T/y. It follows that 
P(rj < for some i) < C {log N)/N by Lemma IH?^ 
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To deal with conditions 3, 4, and 5, let Ij = jUm/N for j = 0, 1, . . . , A^, and define the intervals 
Ii, . . . , In hj Ij = [Ij-i, Ij]- Note that the number of the points ti in an interval Ij is Poisson with 
mean Um'yN- Therefore, the probability that some point ti falls in is at most Um'jN ^ C/N. 
The probability that two or more points fall in Ij is at most u^^y'j^ < C /N"^. If there is a point 
ti G Ij with —Nti ^ T^, then either there are two points in Ij or there is one point in Ij and 
Ij € Zy. The event that there is at least one point in Ij is independent of the event that Ij G Tj^, 
so using Lemma 16.31 again, the probability that both occur is at most C {log N)/N'^. 

When the number of blocks in the coalescent is at most n, the total transition rate of the 
process Iljv is bounded by (2) + Njn- The probability that there is any point ti in Ij is at most 
C/N, so by Lemma IHTTl 

P{UN{Ti) + n7v(ti-) for some ti G Ij) <^[r\^ N-iAE\ti - n] < 



Finally, we may choose 6 small enough that P{si < 6) < e, and then (|6.4() gives 

Cs 



P{Oi + VTilti € Ij) < e + 



logiV' 



where is a constant that depends on b. Therefore, P(di 7^ vTj for some ti £ Ij) < e/N + 
C5/(A^log A^). Since there are only intervals Ij, we can add these bounds to show that the 
probability that conditions 1 through 5 all hold is at least 1 — e for sufficiently large N, which 
implies the statement of the proposition. □ 

Proof of Theorem \2.<A Let < ni < • • • < Um be fixed times. Let e > 0. Define A^r as in Lemma 
16.21 and let Hat be a Pn-valued Ajy-coalescent. In view of Proposition 16.41 it suffices to show that 
for all TTi, . . . , TTm G Pn, we have 

\P{UN{ui) = TTi for all i e {1,... ,m}) - P{Il{ui) = TTi for all i S {1, . . . ,m})\ < e 

for sufficiently large A^. Therefore (see Pitman (1999)), it suffices to show that the measures An 
converge weakly to A. Thus, we need to show (see Billingsley (1999), Theorem 2.1) that for any 
bounded uniformly continuous function h on [0, 1], we have Jq h{x) A^idx) — > h{x) K{dx) as 
A^ 00. By the definitions of Ajv and A, it suffices to show that h{x) r]i\f{dx) h{x) ri{dx) 

as A^ ^ 00 for any bounded uniformly continuous function h on (0, 1] . By the definitions of the 
measures ijn and r], this is equivalent to showing that 



lim / / sh{e-''''^''^^°^^^^^/') NfiN{dxxds)= / / sh(e-'^''^/') ij{dx x ds) (6.6) 

for any bounded uniformly continuous function h on (0, 1]. However, it is easy to deduce (j6.6j) 
from the boundedness and uniform continuity of h, the uniform convergence of (log 2N)rN to r, 
the continuity of r, and the weak convergence of the measures N ^n to /i. □ 

Proof of Proposition \S. Sl Proposition 13.21 can be proved by repeating the proof of Proposition 
16.41 with minor changes. To prove the first part of the proposition, we construct the coalescent 
process Hn as before. Because Nfj.^ = A* log{2N)rN = r for all A^, we have An = A for 
all A^. It follows from Lemma l6 . 21 that II^v is a A-coalescent for all A^. Thus, it suffices to show 
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H6.5I) . but with C/(logA^) on the right-hand side instead of e. Because we are assuming that fj, 
is concentrated on [—L,L] x [e, 1] for some e > 0, we can choose 5 = e and drop the indicator 
from the right-hand side of (|H.4|) to get a bound of C^/ (log N). We then obtain C/(log A^) on the 
right-hand side of (|6.5() by fohowing the same steps as before. 

To prove the second part of Proposition \'A.2[ we modify the definition of Hat. Conditional 
on Ai, we give vTj the distribution Q_R(rjv{a;,)/si,[2Ars,J),n- We set tTj = kq on Af. The intensity 
measure of K'j^ is then given by A x J, where, for all vr 7^ kq, we have 

Ji-^) = [ [ S<3i?(rjv{x)/s,L27VsJ),n(7r) Nfl^idx X ds) 
J-L Jo 

Qs^A'^) GN{dx) = I Q5,,„(vr)( yx'i] ENfi{dx) 



By comparing this with ()3.2() . we see that the process Hat is a Hjv-coalescent. It follows from 
Proposition 13.11 that we can replace C5/(logA^) on the right-hand side of 1)6. 4|) by C5/(logA^)^. 
This gives the second part of the proposition. □ 
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